Overview

Dataset statistics

Number of variables25
Number of observations14388
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 MiB
Average record size in memory200.0 B

Variable types

NUM20
CAT4
BOOL1

Reproduction

Analysis started2020-05-30 11:10:46.790176
Analysis finished2020-05-30 11:11:48.001110
Duration1 minute and 1.21 second
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

intensity_std is highly correlated with intensity_max and 1 other fieldsHigh correlation
intensity_max is highly correlated with intensity_std and 1 other fieldsHigh correlation
intensity_sum is highly correlated with intensity_meanHigh correlation
intensity_mean is highly correlated with intensity_sumHigh correlation
intensity_amplitude_v is highly correlated with intensity_max and 1 other fieldsHigh correlation
params0 is highly skewed (γ1 = -22.49707985) Skewed
params1 is highly skewed (γ1 = 63.68227615) Skewed
params3 is highly skewed (γ1 = 68.59503099) Skewed
params4 is highly skewed (γ1 = 43.40645376) Skewed
spectrum_id has unique values Unique
spectrum_filename has unique values Unique
params0 has unique values Unique
rms has unique values Unique
intensity_std has unique values Unique
layout_x has 398 (2.8%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

Distinct count7436
Unique (%)51.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3600.570336391437
Minimum0
Maximum7435
Zeros2
Zeros (%)< 0.1%
Memory size112.4 KiB

Quantile statistics

Minimum0
5-th percentile359.35
Q11798
median3596.5
Q35395
95-th percentile6833.65
Maximum7435
Range7435
Interquartile range (IQR)3597

Descriptive statistics

Standard deviation2083.835443
Coefficient of variation (CV)0.5787514889
Kurtosis-1.184009281
Mean3600.570336
Median Absolute Deviation (MAD)1798.5
Skewness0.01161603806
Sum51805006
Variance4342370.154
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20472< 0.1%
 
37312< 0.1%
 
14662< 0.1%
 
35152< 0.1%
 
55682< 0.1%
 
14742< 0.1%
 
35232< 0.1%
 
55762< 0.1%
 
14822< 0.1%
 
35312< 0.1%
 
Other values (7426)1436899.9%
 
ValueCountFrequency (%) 
02< 0.1%
 
12< 0.1%
 
22< 0.1%
 
32< 0.1%
 
42< 0.1%
 
ValueCountFrequency (%) 
74351< 0.1%
 
74341< 0.1%
 
74331< 0.1%
 
74321< 0.1%
 
74311< 0.1%
 

spectrum_id
Categorical

UNIQUE

Distinct count14388
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size112.4 KiB
6224874a85b33616068e
 
1
4861966b440c8397c9aa
 
1
cb5f33b0bbc9da03dcb5
 
1
29d2c8fb47d43c31b82c
 
1
d05d592d1eae213b8897
 
1
Other values (14383)
14383
ValueCountFrequency (%) 
6224874a85b33616068e1< 0.1%
 
4861966b440c8397c9aa1< 0.1%
 
cb5f33b0bbc9da03dcb51< 0.1%
 
29d2c8fb47d43c31b82c1< 0.1%
 
d05d592d1eae213b88971< 0.1%
 
70fa103f6c85d7b496fa1< 0.1%
 
6d074e0d339e89db33de1< 0.1%
 
f3bbace8e48bacfa02b51< 0.1%
 
87c7b039da5e1334a2b01< 0.1%
 
19e14a629e47ee80f9d51< 0.1%
 
Other values (14378)1437899.9%
 

Length

Max length20
Median length20
Mean length20
Min length20

spectrum_filename
Categorical

UNIQUE

Distinct count14388
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size112.4 KiB
8e0530950c922e0de50b.dat
 
1
3d3c6f1f5c6a2b5955f4.dat
 
1
fdf3f1e93f2a0d528e85.dat
 
1
b911242b49da94982d27.dat
 
1
61e2218ecb4393410ab2.dat
 
1
Other values (14383)
14383
ValueCountFrequency (%) 
8e0530950c922e0de50b.dat1< 0.1%
 
3d3c6f1f5c6a2b5955f4.dat1< 0.1%
 
fdf3f1e93f2a0d528e85.dat1< 0.1%
 
b911242b49da94982d27.dat1< 0.1%
 
61e2218ecb4393410ab2.dat1< 0.1%
 
2d85d0e687de1cd09330.dat1< 0.1%
 
48796c44ff13799e856f.dat1< 0.1%
 
94b5835b69618b4aff2f.dat1< 0.1%
 
7449f9142926a6240af8.dat1< 0.1%
 
be70b9343fcf8399d7a2.dat1< 0.1%
 
Other values (14378)1437899.9%
 

Length

Max length24
Median length24
Mean length24
Min length24

chip_id
Real number (ℝ≥0)

Distinct count9
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.9274395329441205
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum1
5-th percentile1
Q14
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.763018332
Coefficient of variation (CV)0.4661402815
Kurtosis-1.081548837
Mean5.927439533
Median Absolute Deviation (MAD)2
Skewness-0.5437650292
Sum85284
Variance7.634270305
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
8389727.1%
 
9285319.8%
 
1182112.7%
 
6180512.5%
 
511948.3%
 
311488.0%
 
410087.0%
 
24603.2%
 
72021.4%
 
ValueCountFrequency (%) 
1182112.7%
 
24603.2%
 
311488.0%
 
410087.0%
 
511948.3%
 
ValueCountFrequency (%) 
9285319.8%
 
8389727.1%
 
72021.4%
 
6180512.5%
 
511948.3%
 

exc_wl
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size112.4 KiB
1
8466
2
5922
ValueCountFrequency (%) 
1846658.8%
 
2592241.2%
 

Length

Max length1
Median length1
Mean length1
Min length1

layout_a
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size112.4 KiB
1
3749
3
3712
2
3585
4
3342
ValueCountFrequency (%) 
1374926.1%
 
3371225.8%
 
2358524.9%
 
4334223.2%
 

Length

Max length1
Median length1
Mean length1
Min length1

layout_x
Real number (ℝ≥0)

ZEROS

Distinct count48
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.49548234639978
Minimum0
Maximum47
Zeros398
Zeros (%)2.8%
Memory size112.4 KiB

Quantile statistics

Minimum0
5-th percentile1
Q110
median24
Q337
95-th percentile45
Maximum47
Range47
Interquartile range (IQR)27

Descriptive statistics

Standard deviation14.55929055
Coefficient of variation (CV)0.6196634031
Kurtosis-1.329048378
Mean23.49548235
Median Absolute Deviation (MAD)13
Skewness-0.0514120973
Sum338053
Variance211.9729413
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03982.8%
 
13902.7%
 
43872.7%
 
33742.6%
 
383622.5%
 
333582.5%
 
463522.4%
 
403482.4%
 
443482.4%
 
23482.4%
 
Other values (38)1072374.5%
 
ValueCountFrequency (%) 
03982.8%
 
13902.7%
 
23482.4%
 
33742.6%
 
43872.7%
 
ValueCountFrequency (%) 
472631.8%
 
463522.4%
 
453362.3%
 
443482.4%
 
433052.1%
 

layout_y
Real number (ℝ≥0)

Distinct count192
Unique (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97.44683069224354
Minimum0
Maximum191
Zeros56
Zeros (%)0.4%
Memory size112.4 KiB

Quantile statistics

Minimum0
5-th percentile8
Q146
median99
Q3148
95-th percentile183
Maximum191
Range191
Interquartile range (IQR)102

Descriptive statistics

Standard deviation57.22705902
Coefficient of variation (CV)0.5872644458
Kurtosis-1.271339976
Mean97.44683069
Median Absolute Deviation (MAD)51
Skewness-0.05461754493
Sum1402065
Variance3274.936285
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1431110.8%
 
71110.8%
 
1811020.7%
 
1791010.7%
 
1861010.7%
 
187990.7%
 
50980.7%
 
6980.7%
 
171980.7%
 
147970.7%
 
Other values (182)1337292.9%
 
ValueCountFrequency (%) 
0560.4%
 
1820.6%
 
2800.6%
 
3910.6%
 
4610.4%
 
ValueCountFrequency (%) 
191820.6%
 
190860.6%
 
189860.6%
 
188730.5%
 
187990.7%
 

pos_x
Real number (ℝ)

Distinct count14295
Unique (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.877409570475395
Minimum-1704.704
Maximum1698.19
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum-1704.704
5-th percentile-1606.56925
Q1-972.04915
median187.43875
Q31052.58025
95-th percentile1569.99925
Maximum1698.19
Range3402.894
Interquartile range (IQR)2024.6294

Descriptive statistics

Standard deviation1092.881469
Coefficient of variation (CV)185.9461138
Kurtosis-1.420822706
Mean5.87740957
Median Absolute Deviation (MAD)995.47275
Skewness-0.03974500368
Sum84564.1689
Variance1194389.905
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1344.0892< 0.1%
 
1055.5942< 0.1%
 
1533.172< 0.1%
 
-1696.6222< 0.1%
 
1314.5542< 0.1%
 
1281.5762< 0.1%
 
-1149.8542< 0.1%
 
-1508.1842< 0.1%
 
1569.8622< 0.1%
 
-1667.8312< 0.1%
 
Other values (14285)1436899.9%
 
ValueCountFrequency (%) 
-1704.7041< 0.1%
 
-1704.2221< 0.1%
 
-1704.0371< 0.1%
 
-1704.0051< 0.1%
 
-1702.7771< 0.1%
 
ValueCountFrequency (%) 
1698.191< 0.1%
 
1693.6471< 0.1%
 
1688.861< 0.1%
 
1669.7041< 0.1%
 
1669.5881< 0.1%
 

params0
Real number (ℝ)

SKEWED
UNIQUE

Distinct count14388
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-49.48396538142775
Minimum-162480.31288668307
Maximum2850.6578511143766
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum-162480.3129
5-th percentile-27.09550743
Q146.31447739
median157.5178626
Q3304.0492356
95-th percentile493.3347143
Maximum2850.657851
Range165330.9707
Interquartile range (IQR)257.7347582

Descriptive statistics

Standard deviation3726.311048
Coefficient of variation (CV)-75.30340423
Kurtosis651.4684634
Mean-49.48396538
Median Absolute Deviation (MAD)123.2630787
Skewness-22.49707985
Sum-711975.2939
Variance13885394.03
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-65.98878581< 0.1%
 
29.201332431< 0.1%
 
332.40816021< 0.1%
 
148.15786091< 0.1%
 
94.271613921< 0.1%
 
360.37064831< 0.1%
 
73.801659661< 0.1%
 
203.66511151< 0.1%
 
24.462014141< 0.1%
 
373.1878251< 0.1%
 
Other values (14378)1437899.9%
 
ValueCountFrequency (%) 
-162480.31291< 0.1%
 
-124965.83361< 0.1%
 
-119741.31421< 0.1%
 
-117100.27251< 0.1%
 
-94136.754261< 0.1%
 
ValueCountFrequency (%) 
2850.6578511< 0.1%
 
2205.0086771< 0.1%
 
1993.973771< 0.1%
 
1956.2674071< 0.1%
 
1868.5673961< 0.1%
 

params1
Real number (ℝ≥0)

SKEWED

Distinct count14326
Unique (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean174344.35260081355
Minimum1.2464153485861729e-29
Maximum507377792.1048709
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum1.246415349e-29
5-th percentile1e-10
Q11080.285552
median7001.471568
Q324400.06786
95-th percentile85043.32888
Maximum507377792.1
Range507377792.1
Interquartile range (IQR)23319.7823

Descriptive statistics

Standard deviation6016008.027
Coefficient of variation (CV)34.50646916
Kurtosis4568.134139
Mean174344.3526
Median Absolute Deviation (MAD)6751.647965
Skewness63.68227615
Sum2508466545
Variance3.619235258e+13
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1e-10630.4%
 
953.58102531< 0.1%
 
888.42445021< 0.1%
 
52.860284311< 0.1%
 
6218.3229161< 0.1%
 
42281.714151< 0.1%
 
327.68925151< 0.1%
 
82265.815291< 0.1%
 
9633.7325531< 0.1%
 
19067.914911< 0.1%
 
Other values (14316)1431699.5%
 
ValueCountFrequency (%) 
1.246415349e-291< 0.1%
 
3.967801941e-241< 0.1%
 
6.255908306e-231< 0.1%
 
3.44452566e-211< 0.1%
 
5.262123287e-211< 0.1%
 
ValueCountFrequency (%) 
507377792.11< 0.1%
 
329748667.91< 0.1%
 
252597046.81< 0.1%
 
251951190.41< 0.1%
 
59014441.21< 0.1%
 

params2
Real number (ℝ≥0)

Distinct count14203
Unique (%)98.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1245.5799411067878
Minimum1000.0
Maximum1599.9999999999998
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum1000
5-th percentile1076.305323
Q11090.07876
median1231.312356
Q31362.177354
95-th percentile1515.033737
Maximum1600
Range600
Interquartile range (IQR)272.0985933

Descriptive statistics

Standard deviation150.7385884
Coefficient of variation (CV)0.1210187989
Kurtosis-0.8645515764
Mean1245.579941
Median Absolute Deviation (MAD)138.4313872
Skewness0.413404394
Sum17921404.19
Variance22722.12203
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1597710.5%
 
1003600.4%
 
100080.1%
 
16007< 0.1%
 
1289.0664< 0.1%
 
10004< 0.1%
 
1142.9063< 0.1%
 
1165.923< 0.1%
 
1286.9833< 0.1%
 
1219.2043< 0.1%
 
Other values (14193)1422298.8%
 
ValueCountFrequency (%) 
10004< 0.1%
 
10001< 0.1%
 
10001< 0.1%
 
100080.1%
 
10001< 0.1%
 
ValueCountFrequency (%) 
16003< 0.1%
 
16001< 0.1%
 
16001< 0.1%
 
16001< 0.1%
 
16001< 0.1%
 

params3
Real number (ℝ≥0)

SKEWED

Distinct count13269
Unique (%)92.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.595020055030526
Minimum0.5000000000000001
Maximum4940.099845242949
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum0.5
5-th percentile0.5
Q12
median4.588196205
Q35.908641985
95-th percentile14.79655756
Maximum4940.099845
Range4939.599845
Interquartile range (IQR)3.908641985

Descriptive statistics

Standard deviation63.14547827
Coefficient of variation (CV)9.57472119
Kurtosis4855.784601
Mean6.595020055
Median Absolute Deviation (MAD)2.588196205
Skewness68.59503099
Sum94889.14855
Variance3987.351426
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
27094.9%
 
0.52852.0%
 
0.5120.1%
 
0.5110.1%
 
0.590.1%
 
1090.1%
 
0.580.1%
 
0.57< 0.1%
 
0.56< 0.1%
 
0.56< 0.1%
 
Other values (13259)1332692.6%
 
ValueCountFrequency (%) 
0.52852.0%
 
0.590.1%
 
0.5120.1%
 
0.55< 0.1%
 
0.5110.1%
 
ValueCountFrequency (%) 
4940.0998451< 0.1%
 
4303.511951< 0.1%
 
3655.7708381< 0.1%
 
733.85332491< 0.1%
 
278.28166121< 0.1%
 

params4
Real number (ℝ≥0)

SKEWED

Distinct count14201
Unique (%)98.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean924154.1468560996
Minimum8.1584815152519085e-19
Maximum1283687252.4541855
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum8.158481515e-19
5-th percentile5e-13
Q110601.72245
median21770.23528
Q343253.49235
95-th percentile122748.5237
Maximum1283687252
Range1283687252
Interquartile range (IQR)32651.7699

Descriptive statistics

Standard deviation19199989.3
Coefficient of variation (CV)20.77574328
Kurtosis2321.441145
Mean924154.1469
Median Absolute Deviation (MAD)13993.54673
Skewness43.40645376
Sum1.329672986e+10
Variance3.686395893e+14
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5e-131881.3%
 
36252.343441< 0.1%
 
131977.46041< 0.1%
 
6730.4855181< 0.1%
 
12076.937921< 0.1%
 
11103.829591< 0.1%
 
41107.858651< 0.1%
 
19217.003081< 0.1%
 
21468.150971< 0.1%
 
41447.298331< 0.1%
 
Other values (14191)1419198.6%
 
ValueCountFrequency (%) 
8.158481515e-191< 0.1%
 
3.1323902e-181< 0.1%
 
4.074092224e-181< 0.1%
 
4.742827243e-181< 0.1%
 
6.693133025e-181< 0.1%
 
ValueCountFrequency (%) 
12836872521< 0.1%
 
989515634.61< 0.1%
 
819365760.11< 0.1%
 
717043701.91< 0.1%
 
573587229.31< 0.1%
 

params5
Real number (ℝ≥0)

Distinct count14171
Unique (%)98.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1249.4005912290147
Minimum1000.0
Maximum1599.9999999999998
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum1000
5-th percentile1003.216973
Q11089.009329
median1232.251148
Q31366.971049
95-th percentile1596.997198
Maximum1600
Range600
Interquartile range (IQR)277.9617196

Descriptive statistics

Standard deviation164.0534498
Coefficient of variation (CV)0.1313057245
Kurtosis-0.747870197
Mean1249.400591
Median Absolute Deviation (MAD)142.5137665
Skewness0.4621251171
Sum17976375.71
Variance26913.5344
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000670.5%
 
1600550.4%
 
1597400.3%
 
1003300.2%
 
1597180.1%
 
10036< 0.1%
 
16005< 0.1%
 
16004< 0.1%
 
1362.6254411< 0.1%
 
1406.7031451< 0.1%
 
Other values (14161)1416198.4%
 
ValueCountFrequency (%) 
10001< 0.1%
 
10001< 0.1%
 
10001< 0.1%
 
10001< 0.1%
 
10001< 0.1%
 
ValueCountFrequency (%) 
16004< 0.1%
 
16001< 0.1%
 
16001< 0.1%
 
16001< 0.1%
 
1600550.4%
 

params6
Real number (ℝ≥0)

Distinct count14382
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean88242.70284876494
Minimum0.5000000231602091
Maximum4883344.165783549
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum0.5000000232
5-th percentile5.48301671
Q17.953383392
median11.3821437
Q316.96238096
95-th percentile81697.06043
Maximum4883344.166
Range4883343.666
Interquartile range (IQR)9.008997568

Descriptive statistics

Standard deviation523558.8148
Coefficient of variation (CV)5.933168386
Kurtosis40.28262086
Mean88242.70285
Median Absolute Deviation (MAD)3.900304466
Skewness6.411890542
Sum1269636009
Variance2.741138326e+11
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
107< 0.1%
 
13.530513251< 0.1%
 
13.690647841< 0.1%
 
2933.3473071< 0.1%
 
11.208381871< 0.1%
 
14.813644591< 0.1%
 
11.437543781< 0.1%
 
26.31474061< 0.1%
 
10.913547641< 0.1%
 
24.795554131< 0.1%
 
Other values (14372)1437299.9%
 
ValueCountFrequency (%) 
0.50000002321< 0.1%
 
0.50000773611< 0.1%
 
0.50054360721< 0.1%
 
0.62507194371< 0.1%
 
0.69473313941< 0.1%
 
ValueCountFrequency (%) 
4883344.1661< 0.1%
 
4689999.771< 0.1%
 
4002886.9881< 0.1%
 
4001311.6031< 0.1%
 
4001233.0491< 0.1%
 

rms
Real number (ℝ≥0)

UNIQUE

Distinct count14388
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.788097914002476
Minimum6.110799020422154
Maximum212.5098034623708
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum6.11079902
5-th percentile7.241705084
Q18.289552626
median9.822982525
Q313.8692117
95-th percentile28.13839411
Maximum212.5098035
Range206.3990044
Interquartile range (IQR)5.579659073

Descriptive statistics

Standard deviation8.394044549
Coefficient of variation (CV)0.6563950797
Kurtosis48.21474878
Mean12.78809791
Median Absolute Deviation (MAD)1.974224635
Skewness4.768462979
Sum183995.1528
Variance70.4599839
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10.198017411< 0.1%
 
10.28930171< 0.1%
 
7.2616567521< 0.1%
 
8.3737383141< 0.1%
 
8.86797841< 0.1%
 
19.618487081< 0.1%
 
14.465048011< 0.1%
 
19.155972461< 0.1%
 
6.8537131221< 0.1%
 
21.842008191< 0.1%
 
Other values (14378)1437899.9%
 
ValueCountFrequency (%) 
6.110799021< 0.1%
 
6.1384890531< 0.1%
 
6.2299447151< 0.1%
 
6.2751241181< 0.1%
 
6.2973440121< 0.1%
 
ValueCountFrequency (%) 
212.50980351< 0.1%
 
133.85520831< 0.1%
 
129.38126031< 0.1%
 
114.83380411< 0.1%
 
109.19491511< 0.1%
 

beta
Real number (ℝ≥0)

Distinct count12920
Unique (%)89.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3398781139764335
Minimum3.3369076773278623e-37
Maximum1.0
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum3.336907677e-37
5-th percentile2.397680408e-15
Q10.04737269486
median0.2307974401
Q30.546055588
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.4986828931

Descriptive statistics

Standard deviation0.3389495033
Coefficient of variation (CV)0.9972678125
Kurtosis-0.6959015507
Mean0.339878114
Median Absolute Deviation (MAD)0.2160081881
Skewness0.804329633
Sum4890.166304
Variance0.1148867658
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
113879.6%
 
1280.2%
 
1180.1%
 
1150.1%
 
1120.1%
 
15< 0.1%
 
14< 0.1%
 
13< 0.1%
 
13< 0.1%
 
12< 0.1%
 
Other values (12910)1291189.7%
 
ValueCountFrequency (%) 
3.336907677e-371< 0.1%
 
5.533556644e-331< 0.1%
 
1.262773621e-301< 0.1%
 
3.481021966e-301< 0.1%
 
6.42219085e-301< 0.1%
 
ValueCountFrequency (%) 
113879.6%
 
1280.2%
 
1180.1%
 
15< 0.1%
 
1150.1%
 

intensity_max
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count5018
Unique (%)34.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3309.3096334445368
Minimum1059.0
Maximum58900.0
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum1059
5-th percentile1369.35
Q11874
median2543.5
Q33626
95-th percentile7498.95
Maximum58900
Range57841
Interquartile range (IQR)1752

Descriptive statistics

Standard deviation2974.185456
Coefficient of variation (CV)0.8987329037
Kurtosis72.55313644
Mean3309.309633
Median Absolute Deviation (MAD)784.5
Skewness6.475445569
Sum47614347.01
Variance8845779.128
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1951160.1%
 
1827150.1%
 
1921140.1%
 
1754140.1%
 
1540140.1%
 
2538130.1%
 
2129130.1%
 
2120130.1%
 
1759130.1%
 
1949130.1%
 
Other values (5008)1425099.0%
 
ValueCountFrequency (%) 
10592< 0.1%
 
10611< 0.1%
 
10661< 0.1%
 
10721< 0.1%
 
10732< 0.1%
 
ValueCountFrequency (%) 
589001< 0.1%
 
587871< 0.1%
 
555681< 0.1%
 
532721< 0.1%
 
496081< 0.1%
 

intensity_min
Real number (ℝ)

Distinct count695
Unique (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-194.71717170246038
Minimum-520.0
Maximum598.0
Zeros3
Zeros (%)< 0.1%
Memory size112.4 KiB

Quantile statistics

Minimum-520
5-th percentile-325
Q1-262
median-214
Q3-146
95-th percentile-2
Maximum598
Range1118
Interquartile range (IQR)116

Descriptive statistics

Standard deviation101.1643417
Coefficient of variation (CV)-0.5195450448
Kurtosis1.834587134
Mean-194.7171717
Median Absolute Deviation (MAD)56
Skewness1.03521412
Sum-2801590.666
Variance10234.22403
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-230930.6%
 
-243880.6%
 
-233870.6%
 
-238870.6%
 
-222840.6%
 
-226830.6%
 
-260820.6%
 
-225820.6%
 
-254820.6%
 
-229810.6%
 
Other values (685)1353994.1%
 
ValueCountFrequency (%) 
-5201< 0.1%
 
-4981< 0.1%
 
-4931< 0.1%
 
-4861< 0.1%
 
-4731< 0.1%
 
ValueCountFrequency (%) 
5981< 0.1%
 
3811< 0.1%
 
3641< 0.1%
 
3551< 0.1%
 
3401< 0.1%
 

intensity_mean
Real number (ℝ)

HIGH CORRELATION

Distinct count14382
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean324.6632990463718
Minimum-4.508463605468748
Maximum2339.933897847358
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum-4.508463605
5-th percentile82.71563586
Q1161.7752821
median255.4607205
Q3416.1479317
95-th percentile805.9454887
Maximum2339.933898
Range2344.442361
Interquartile range (IQR)254.3726496

Descriptive statistics

Standard deviation235.602061
Coefficient of variation (CV)0.7256812265
Kurtosis4.797245236
Mean324.663299
Median Absolute Deviation (MAD)112.5761718
Skewness1.810308994
Sum4671255.547
Variance55508.33117
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
260.79452052< 0.1%
 
203.04882812< 0.1%
 
273.09092892< 0.1%
 
104.66688362< 0.1%
 
231.22070312< 0.1%
 
137.11523442< 0.1%
 
297.76081761< 0.1%
 
300.4233531< 0.1%
 
402.00434031< 0.1%
 
491.59548611< 0.1%
 
Other values (14372)1437299.9%
 
ValueCountFrequency (%) 
-4.5084636051< 0.1%
 
11.789496481< 0.1%
 
12.288628471< 0.1%
 
13.042751741< 0.1%
 
17.355468871< 0.1%
 
ValueCountFrequency (%) 
2339.9338981< 0.1%
 
2176.256511< 0.1%
 
2091.2196121< 0.1%
 
2090.4757561< 0.1%
 
2055.6894531< 0.1%
 

intensity_std
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count14388
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean451.5886300826231
Minimum135.01688569310494
Maximum7724.809227363108
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum135.0168857
5-th percentile195.0610229
Q1268.8863562
median359.2782582
Q3498.3567896
95-th percentile1006.917582
Maximum7724.809227
Range7589.792342
Interquartile range (IQR)229.4704334

Descriptive statistics

Standard deviation365.0644098
Coefficient of variation (CV)0.8084003571
Kurtosis62.42204577
Mean451.5886301
Median Absolute Deviation (MAD)105.1197064
Skewness5.830788673
Sum6497457.21
Variance133272.0233
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1272.3254571< 0.1%
 
514.39625771< 0.1%
 
538.5772221< 0.1%
 
263.72527961< 0.1%
 
521.89057891< 0.1%
 
374.14320841< 0.1%
 
272.7321331< 0.1%
 
515.48005611< 0.1%
 
264.79655521< 0.1%
 
263.76811871< 0.1%
 
Other values (14378)1437899.9%
 
ValueCountFrequency (%) 
135.01688571< 0.1%
 
135.51444481< 0.1%
 
138.98186371< 0.1%
 
139.99652361< 0.1%
 
140.86634511< 0.1%
 
ValueCountFrequency (%) 
7724.8092271< 0.1%
 
7379.8340491< 0.1%
 
6683.5987681< 0.1%
 
6398.1230751< 0.1%
 
6292.0597541< 0.1%
 

intensity_sum
Real number (ℝ)

HIGH CORRELATION

Distinct count14377
Unique (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean166114.79184404493
Minimum-2308.333365999999
Maximum1195706.2218
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum-2308.333366
5-th percentile42350.40556
Q182814.27777
median130657.8333
Q3212982.8056
95-th percentile411889.4724
Maximum1195706.222
Range1198014.555
Interquartile range (IQR)130168.5278

Descriptive statistics

Standard deviation120514.0318
Coefficient of variation (CV)0.7254864571
Kurtosis4.799542562
Mean166114.7918
Median Absolute Deviation (MAD)57571.83327
Skewness1.810653528
Sum2390059625
Variance1.452363186e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1183852< 0.1%
 
702032< 0.1%
 
1203112< 0.1%
 
1039612< 0.1%
 
1332662< 0.1%
 
139822.55562< 0.1%
 
159302.55562< 0.1%
 
946902< 0.1%
 
1077512< 0.1%
 
682732< 0.1%
 
Other values (14367)1436899.9%
 
ValueCountFrequency (%) 
-2308.3333661< 0.1%
 
6036.22221< 0.1%
 
6291.7777761< 0.1%
 
6677.888891< 0.1%
 
8886.000061< 0.1%
 
ValueCountFrequency (%) 
1195706.2221< 0.1%
 
1114243.3331< 0.1%
 
1068613.2221< 0.1%
 
1068233.1111< 0.1%
 
10525131< 0.1%
 

intensity_amplitude_v
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count5126
Unique (%)35.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3504.0268051469975
Minimum1091.0
Maximum59209.0
Zeros0
Zeros (%)0.0%
Memory size112.4 KiB

Quantile statistics

Minimum1091
5-th percentile1567.35
Q12071
median2733.5
Q33814
95-th percentile7725.65
Maximum59209
Range58118
Interquartile range (IQR)1743

Descriptive statistics

Standard deviation2978.62227
Coefficient of variation (CV)0.8500569304
Kurtosis72.53977021
Mean3504.026805
Median Absolute Deviation (MAD)777.5
Skewness6.478267133
Sum50415937.67
Variance8872190.628
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1979150.1%
 
1996140.1%
 
2745140.1%
 
1916130.1%
 
2046130.1%
 
1864130.1%
 
2680130.1%
 
2196130.1%
 
2559120.1%
 
1852120.1%
 
Other values (5116)1425699.1%
 
ValueCountFrequency (%) 
10911< 0.1%
 
11131< 0.1%
 
11511< 0.1%
 
11641< 0.1%
 
12012< 0.1%
 
ValueCountFrequency (%) 
592091< 0.1%
 
589911< 0.1%
 
557391< 0.1%
 
536801< 0.1%
 
498731< 0.1%
 

type
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size112.4 KiB
0
7436
1
6952
ValueCountFrequency (%) 
0743651.7%
 
1695248.3%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

df_indexspectrum_idspectrum_filenamechip_idexc_wllayout_alayout_xlayout_ypos_xparams0params1params2params3params4params5params6rmsbetaintensity_maxintensity_minintensity_meanintensity_stdintensity_sumintensity_amplitude_vtype
00000da4633378740f1ee8b2e223339f4abce9b400.dat111361401313.08130.809581.1801037.7151.53122469.6521032.3178.29610.0290.0251751.000-228.00040.293172.20720629.8891979.0000
11000ed1a5a9fe0ad2b7dde2f150a503244145e7ce.dat1220168159.41591.30117405.8201080.5104.76633257.1231077.4698.0187.9480.3444219.000-263.000166.959463.42885483.0004482.0000
220016e3322c4ce0700f9a3d58b7ccaee157979cf0.dat2233429-610.769106.6430.0001119.4642.00042579.8681378.88311.68710.7400.0002412.000-235.000151.578327.85877607.7782647.0000
3300256bd0f8c6cf5f59c8ed3641184d3b7c0ae703.dat221321391214.618306.93410994.8651139.8555.19939349.7421145.2139.44510.3800.2183209.000-52.000523.081436.481267817.4453261.0000
44003483ee5ae313d375904c63418d39f86dfab9bb.dat2244585-257.61646.13322276.2191120.9185.66831054.9291117.1087.6598.3170.4183998.000-245.000138.188472.01070752.1114243.0000
550037f18f5aaec409bef7cf8657e1503943995dbe.dat3222650988.16816.5970.0001090.0152.00034322.9241217.30611.5527.7520.0001894.000-285.00089.497257.85445822.2222179.0000
660041c80d2bb0ad8f0c1e7a226cdd3178326a87f1.dat42447118-190.857-61.5815348.6511288.5791.211233242.0281285.6099.85816.3930.02214818.000-212.000441.1061815.520225846.11115030.0000
770046ee889a24134bd351bb142787b681af27ac86.dat5126118355.043165.875727.2071285.7741.01854186.2431281.5939.07316.6230.0133934.000-231.000204.117481.299104507.8894165.0000
880050161961b3e2e398de662d2f3542adf199f3a2.dat6212360899.42857.5595012.3011512.4804.76027461.4001506.01714.16411.0900.1541608.000-261.00084.236244.68943129.0001869.0000
99005c0d491a359ccff5832c576da41d645d597091.dat5112268865.548-15.4580.0001126.9762.000123489.2181430.43614.7658.6470.0005291.000-253.000250.787751.088128402.7785544.0000

Last rows

df_indexspectrum_idspectrum_filenamechip_idexc_wllayout_alayout_xlayout_ypos_xparams0params1params2params3params4params5params6rmsbetaintensity_maxintensity_minintensity_meanintensity_stdintensity_sumintensity_amplitude_vtype
143786942ffb380926d97f0c17e6ecf32faa5f7fd96ab9b4c.dat91133471244.485210.0415958.1781089.6512.9696286.5721086.7005.1757.9170.4871835.000-37.000282.030177.966144399.3331872.0001
143796943ffb8e22c7abd6d35da709418b8210c2e55cf2af9.dat821341841247.067145.5156244.9921502.8362.96319657.4821505.07911.50411.7700.2416326.000-237.000396.640928.964202683.0006563.0001
143806944ffd320b432ea04daf25bab29262a63382cb75f4e.dat91339183-448.015460.70719726.3331088.9374.05027528.5921087.7609.5569.9760.4175143.00021.000651.312525.208333471.6675122.0001
143816945ffd5a8c80f8d30ad37f83528d345a9fef49494a9.dat92335142-576.8681038.004232553.1261284.72611.8100.0001599.994232556.73697.9541.00012623.000-195.000870.7471985.032445822.66712818.0001
143826946ffdd738462bbf44c87318e75b09f759be1433b58.dat814347-609.691123.4191064.0901086.3860.50010795.0841086.4956.6919.7440.0901656.000-211.000250.816272.447128166.7781867.0001
143836947ffe3f18bccea9eca0c4ba9309e1b871e8089dedb.dat8122114220.997150.3511575.7441087.5300.50011981.8321085.8645.7409.8930.1161650.000-148.000327.154247.617167175.4451798.0001
143846948ffe5dc9b0008f1686fbb01d6b771f9b18d2c8be5.dat92216181702.840285.0455157.4881153.6984.53416558.3251149.5586.34719.9130.2372159.000-100.000371.267379.261190088.8892259.0001
143856949ffe99ef3b8a4ffb5cbfd6dc212d4616d7e28ac68.dat8232539-897.361201.5951298.7871137.0384.39848307.4461162.60010.3058.3470.0263164.000-183.000267.291383.777136585.8893347.0001
143866950fff6557194ea0487af9273db945d1ec8d0d97b51.dat924395-1599.428986.4161536.9741155.2522.43814780.3911154.47917.3619.9720.0941994.00025.000951.654370.822487246.8891969.0001
143876951ffffb084eeba6fd04e59d147cb4379e428a08ebf.dat81228101057.12848.644446.1581552.5150.50054420.9811559.17714.4789.8810.0082420.000-204.000141.838334.49672479.3332624.0001